Symbolic Nearest Mean Classifiers

نویسندگان

  • Piew Datta
  • Dennis F. Kibler
چکیده

The minimum-distance classifier summarizes each class with a prototype and then uses a nearest neighbor approach for classification. Three drawbacks of the original minimum-distance classifier are its inability to work with symbolic attributes, weigh attributes, and learn more than a single prototype for each class. The proposed solutions to these problems include defining the mean for symbolic attributes, providing a weighting metric, and learning several possible prototypes for each class. The learning algorithm developed to tackle these problems, SNMC, increases classification accuracy by 10% over the original minimum-distance classifier and has a higher average generalization accuracy than both C4.5 and PEBLS on 20 domains from the UC1 data repository.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MIREX 2005: Symbolic Genre classification with an ensemble of parametric and lazy classifiers

The symbolic genre classification algorithm submited to the MIREX (Music Information Retrieval Exchange) 2005 is described here. Our algorithm uses a combination of k-nearest neighbors and Bayesian classifiers trained with different sets of statistical descriptors extracted from melody tracks extracted from MIDI files. It is aimed at classifying melodies by genre. The statistical descriptors de...

متن کامل

From classifiers to discriminators: A nearest neighbor rule induced discriminant analysis

The current discriminant analysis method design is generally independent of classifiers, thus the connection between discriminant analysis methods and classifiers is loose. This paper provides a way to design discriminant analysis methods that are bound with classifiers. We begin with a local mean based nearest neighbor (LM-NN) classifier and use its decision rule to supervise the design of a d...

متن کامل

Feature Selection Approach in Animal Classification

In this paper, we propose a model for automatic classification of Animals using different classifiers Nearest Neighbour, Probabilistic Neural Network and Symbolic. Animal images are segmented using maximal region merging segmentation. The Gabor features are extracted from segmented animal images. Discriminative texture features are then selected using the different feature selection algorithm l...

متن کامل

Weighting Unusual Feature Types

Feature weighting is known empirically to improve classification accuracy for k-nearest neighbor classifiers in tasks with irrelevant features. Many feature weighting algorithms are designed to work with symbolic features, or numeric features, or both, but cannot be applied to problems with features that do not fit these categories. This paper presents a new k-nearest neighbor feature weighting...

متن کامل

Integration of Clinical and Gene Expression Data Has a Synergetic Effect on Predicting Breast Cancer Outcome

Breast cancer outcome can be predicted using models derived from gene expression data or clinical data. Only a few studies have created a single prediction model using both gene expression and clinical data. These studies often remain inconclusive regarding an obtained improvement in prediction performance. We rigorously compare three different integration strategies (early, intermediate, and l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997